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Abstract 

The aim of this study was to evaluate the Enterobacterial Repetitive Intergenic Consensus (ERIC-PCR) as a tool for molecular 
typing of C. pseudotuberculosis isolates from eight different hosts in twelve countries. Ninety-nine C. pseudotuberculosis field 
strains, one type strain (ATCC 1 941 0 T ) and one vaccine strain (1002) were fingerprinted using the ERIC-1R and ERIC-2 
primers, and the ERIC-1 R+ERIC-2 primer pair. Twenty-nine different genotypes were generated by ERIC 1-PCR, 28 by ERIC 2- 
PCR and 35 by ERIC 1+2-PCR. The discriminatory index calculated for ERIC 1, ERIC 2, and ERIC 1+2-PCR was 0.89, 0.86, and 
0.92, respectively. Epidemiological concordance was established for all ERIC-PCR assays. ERIC 1+2-PCR was defined as the 
best method based on suitability of the amplification patterns and discriminatory index. Minimal spanning tree for ERIC 1+2- 
PCR revealed three major clonal complexes and clustering around nitrate-positive (biovar Equi) and nitrate-negative (biovar 
Ovis) strains. Therefore, ERIC 1+2-PCR proved to be the best technique evaluated in this study for genotyping C. 
pseudotuberculosis strains, due to its usefulness for molecular epidemiology investigations. 
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Introduction 

Corynebacterium pseudotuberculosis is a Gram-positive, facultative 
intracellular bacterium, classified into two biotypes based on host 
preferences and nitrate-reducing activity. It is the causative agent 
of several infectious and contagious chronic diseases, including 
caseous lymphadenitis (CLA), ulcerative lymphangitis, mastitis, 
and oedematous skin disease (OSD), in a broad spectrum of hosts 
[1-5]. It is a common pathogen of sheep, goat and horses. It also 
causes sporadic infections in other species including cattle, buffalo, 
camels, llamas and humans [1—9]. 

In sheep and goats, C. pseudotuberculosis is etiological agent of the 
caseous lymphadenitis, predominantly caused by the nitrate- 
negative biovar Ovis strains [5] . This disease is widely distributed, 
with high prevalences in Australia [10], Brazil [11-13] and 
Argentina [14], where it is responsible for significant economic 
losses in wool, milk and meat production. In horses and water 
buffalos, C. pseudotuberculosis infection is responsible for ulcerative 
lymphangitis or chronic abscesses and edematous skin disease, 
respectively, being in both cases mainly caused by the nitrate- 
positive biovar Equi strains [3,4]. Whereas, in cattle, C. 



pseudotuberculosis infection can be caused by both biovars and 
produces three clinical forms, cutaneous, mastitic and visceral, 
being the two last less common [1,15,16]. 

A great variety of DNA-based methods have been used for 
determining genotypes in individual isolates of C. pseudotuberculosis, 
including enzyme restriction of chromosomal DNA [17,18], 
ribotyping [18-20], polymerase chain reaction - restriction 
fragment length polymorphism (PCR - RFLP) [21], Pulse-Field 
Gel Eletrophoresis (PFGE) [22] and Random Amplified Polymor- 
phic DNA (RAPD) [23]. However, these techniques have revealed 
high genetic homogeneity within the species. This could reflect the 
clonal-like behavior of this pathogen or limitations in the methods 
used for strain identification. 

Recently, our group proposed a typing method based on 
enterobacterial repetitive intergenic consensus-PCR (ERIC-PCR), 
which was shown to be a good test for genetic discrimination of C. 
pseudotuberculosis field strains from sheep and goats, with high 
resolution, repeatability and typeability [24,25]. However, all 
previously typed isolates belonged to biovar Ovis and came from 
Brazil. Thus, the present study aimed to evaluate ERIC-PCR as a 
tool for molecular typing of C. pseudotuberculosis isolates from eight 
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different hosts (buffalo, camel, cattle, goat, horse, human, llama 
and sheep) in twelve countries (Argentina, Australia, Belgium, 
Brazil, Chile, Egypt, England, France, Israel, Kenya, Scodand, 
and USA). 

Materials and Methods 

Bacterial Strains 

Ninety-nine C. pseudotuberculosis field isolates, C. pseudotuberculosis 
ATCC 1 94 1 0 T type strain and C. pseudotuberculosis 1002 vaccine 
strain were selected for genotyping by ERIC-PCR. These were 
representative strains selected from the collection of the Labor- 
aterio de Genetica Celular e Molecular, Institute de Ciencias 
Biologicas, Universidade Federal de Minas Gerais, which receives 
strains from the most active laboratories on C. pseudotuberculosis 
research in the world. Moreover, they represent important C. 
pseudotuberculosis strains for the hosts or countries they were isolated 
in, as many of the bacteria used in the study have been selected for 
genome sequencing (CIP 52.97, PATIO, 119, FRC41, Cp31, 
Cpl62, Cp267, Cpl002 and CpC231) [9,26-32]. Information on 
host, country of origin and biovar of the C. pseudotuberculosis field 
isolates is summarized in the Table 1 . This collection of strains 
includes several isolates from animals and a single isolate from a 
human, that were identified by routine phenotypical tests [5] , and 
their species identification was confirmed by phospholipase D 
(PLD) PCR [33]. Among the 101 studied strains, 27 belong to 
biovar Equi and 74 to biovar Ovis (Table 1). 

ERIC-PCR 

C. pseudotuberculosis genomic DNA was extracted as previously 
described [33]. The strains were fingerprinted by ERIC-PCR 
using the primers ERIC-1R (5'-ATGTAAGCTCCTGGG- 
GATTCAC-3'), ERIC-2 (5 ' -AAGTAAGTGACTGGGGT- 
GAGCG-3') and the ERIC-1R+ERIC-2 primer pair (Invitrogen, 
Carlsbad, CA, USA) [34] as previously described [24]. Briefly, the 
PCR reaction was performed using 50.0 mM Tris, 1.5 mil 
MgC12, 10.0 mM KC1, 50.0 mM (NH4)2S04 (pH 8.3) (Phoneu- 
tria, Brazil); 0.2 mM of dNTP (each) (Life Technologies, USA); 
2.0 mM of each primer (Life Technologies, USA); 2.5 units of Taq 
DNA polymerase (Phoneutria, Brazil) and 100.0 ng of template. 

Data Analysis 

Band size estimates and genotype analyses were done using the 
software BioNumerics 6.5 (Applied Maths, Sint-Martens-Latem, 
Belgium). Clustering analysis was performed with the same 
software based on the Dice similarity coefficient and the 
unweighted pair group method with arithmetic mean (UPGMA). 
The Hunter and Gaston Diversity Index (HGDI) was calculated 
[35] for ERIC 1, ERIC 2, and ERIC 1+2. The typeability was 
evaluated from the proportion of isolates that were scored in the 
ERIC-PCR assays and assigned a type [36]. The three ERIC- 
PCR assays were also classified based on amplification pattern by 
evaluation of resolution of DNA amplification bands, average 
number of bands per genotype and presence of smearing. 

The minimum-spanning trees (MST) were generated using the 
UPGMA to calculate the distance matrix Prim's algorithm 
associated with the priority rule and the permutation resampling 
[37,38]. The MST presented is the top score tree, the tree with the 
highest overall reliability score. Clonal complex term is frequently 
used in MLST (Multilocus Sequence Typing) analysis to describe 
patterns of evolutionary descent and is defined as a group of 
genotypes that share a minimum of 5 (a total of 7) (71.42%) loci 
[37]. Due to the great difference in resolution between the 
techniques (MLST vs ERIC-PCR), we determined, for ERIC- 
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PGR analyses, to use clonal complex for a group of strains that 
share 100% of similarity on genotype analysis. However, for 
dendrogram analysis the clusters were classified based on ~80% of 
similarity (ERIC 1 - labeled A-K; ERIC 2 - labeled A-I; ERIC 1 + 
2 - labeled A-K). 

Statistical Analysis 

The global agreement among the three techniques was 
calculated evaluating the number (n) of different genotypes in 
each assay of ERIC-PCR, per host or per country, using the 
nonparametric Kendall's W statistic [39,40] with the aid of R 
software version 2.9.0 (R Development Core Team, Vienna, WIE, 
Austria). 

Results 

ERIC-PCR Genotyping 

ERIC-PCR was able to fingerprint and assign a type to all the 
101 C. pseudotuberculosis strains studied from different hosts and 
geographic origins. For all ERIC-PCR assays, the previously 
described genotypes were assigned the same identification label 
used by Guimaraes et al. (2011) [24] and Dorneles et al. (2012) 
[25], and the new genotypes were labeled sequentially in the same 
way (Figures 1,2, and 3). 

Among all studied strains, ERIC 1-PCR resolved 29 genotypes 
(Figure 1), ERIC 2-PCR 28 genotypes (Figure 2), and ERIC 1+2- 
PCR 35 genotypes (Figure 3). The HGDI calculated for the ERIC 
1-, ERIC 2- and ERIC 1+2-PCR were 0.89, 0.86, and 0.92, 
respectively. For ERIC 1-PCR, among the 29 genotypes resolved, 
15.85% (6/29) had already been described for C. pseudotuberculosis 
strains isolates from sheep and goats of the Brazilian States of 
Minas Gerais and Pernambuco [24,25], whereas the new 
genotypes constituted 84.15% (23/29) of all typed C. pseudotuber- 
culosis. The El. 3 7 genotype was the most prevalent one containing 
27.72% (28/101) of the tested strains, followed by El. 30 that 
occurred in 11.88% (12/101) of the strains. The average number 
of bands observed in genotyping by ERIC 1-PCR was 5.60, 
ranging ranged from 168.4 bp to 1,578 bp. Four bands were more 
frequent in 67.32% (68/101) of the strains in this assay. The 
molecular weights of the four frequent bands were approximately 
168.4 bp, 342.4 bp, 589.2 bp and 652.7 bp. 

Of the 28 genotypical profiles resolved by ERIC 2-PCR, five 
had been previously described [24,25], whereas the genotypes of 
90.09% (91/101) of all studied strains were new. The E2.38 
genotype was the most prevalent one representing 25.74% (26/ 
101) of the strains, followed by E2.37 with 20.79% (21/101) and 
E2.40 with 14.85% (15/101) of prevalence. Four bands were most 
frequent in 80.19% (81/101) of all strains. The molecular weights 
of the frequent four amplicons were approximately 195.1 bp, 
378.8 bp, 430.84 and 666.5 bp, while all observed amplicons 
ranged from 195.1 bp to 1,290 bp. 

Of the 35 genotypes determined by ERIC 1+2-PCR, only five 
had been previously reported [24,25], whereas the genotypes of 
90.09% (91/101) of all studies strains were not previously 
described. The most prevalent ERIC 1+2 genotypes were 
E12.41 and E12.34, which were observed in 18.81% (19/101) 
and 17.82 (18/101) of all tested strains, respectively. The average 
number of bands observed by genotypes in this assay was 7.35. 
The molecular weights of the frequent fragments were 104.6 bp, 
391.8 bp, and 621.9 bp and they were found in 81.18% (82/101) 
of tested strains, while all PCR products by this reaction ranged 
from 94 bp to 1,282 bp. 



ERIC-PCR Clustering Analyses 

In the ERIC 1-PCR dendrogram, cluster A was composed by 
one sheep strain from Argentina and another camel isolated from 
England, and cluster B consisted of one sheep and one goat strains 
from Sao Paulo State, Brazil. Cluster C was composed by one 
horse and one goat strains from Chile and Brazil, respectively, and 
cluster D by two USA horse isolates. The isolates grouped in 
cluster E (85.1%) originated from 27 sheep (23 from Brazil, two 
from Argentina, one from Australia and the ATCC 194 10 1 
reference strain), 27 goats (26 from Brazil and one from USA), 26 
buffalo (from Egypt), four cattle (from Israel), one horse (from 
USA) and one llama (from England). Each of clusters G, H, I, J 
and K were represented by a single isolate from buffalo (Egypt), 
sheep (Brazil), horse (Scotland), goat (Brazil) and human (France), 
respectively. 

In ERIC 2 cluster analysis, cluster A was composed by one 
horse (Scotland) and one goat (Brazil) isolate, while cluster B was 
composed by a single human isolate, and cluster C of two isolates; 
one from buffalo (Egypt) and one from sheep (Brazil). Cluster D 
included four genotypes (five strains), originated from four goats 
(Brazil) and one sheep (ATCC 19410 T reference strain) strains. 
Cluster E comprised 84. 1 % of typed strains and was composed by 
27 sheep (23 from Brazil, three from Argentina and one from 
Australia), 22 goats (2 1 from Brazil and one from USA), 24 buffalo 
(from Egypt), four cattle (from Israel), six horses (three from USA, 
one from Kenya, one from Belgium and one from Chile), one 
llama (from USA) and one camel (from England) strains. Clusters 
F and G included one sheep and one goat strain, both from Brazil, 
whereas clusters H and I were formed by two sheep (Brazil) and 
two buffalo (Egypt) strains, respectively. 

In the ERIC 1+2 dendrogram, cluster A consisted of three 
sheep and three goats strains from Brazil, two buffalo isolates from 
Egypt, beyond the reference strain ATCC 19410 T . Clusters B and 
C were composed by a single isolate, one goat (from Brazil) and 
one human (from France) isolate, respectively. Cluster D was 
formed only by sheep isolates, all from Brazil. Clusters E and F 
had most of the genotyped strains. Cluster E was composed by 1 1 
sheep (eight from Brazil, two from Argentina and one from 
Australia), 22 goat (21 from Brazil and one from USA), seven 
buffalo (from Egypt), three cattle (from Israel) and one llama (from 
USA). Cluster F was composed by six sheep (five from Brazil and 
one from Argentina), six horse (three from USA, one from 
Belgium, one from Kenya and one from Chile), 17 buffalo (from 
Egypt), one catde (from Israel) and one camel (from England). 
Cluster G was formed by two sheep and one goat strains both from 
Brazil. Finally, cluster H was composed by two Brazilian goat 
isolates, whereas cluster I, J and K included one buffalo (from 
Egypt), one sheep (from Brazil), and one horse (from Scotland) and 
one sheep (from Brazil), respectively. 

The Kendall's W coefficient of concordance observed among 
the three techniques of ERIC-PCR was 0.982 (P = 0.00438) when 
the results were grouped by host species and 0.991 (P = 0.00059) 
when they were grouped by country of origin. ERIC 1+2-PCR 
showed higher HGDI and better consistency, complexity and 
performance of DNA amplification than other assays tested. 

Clustering Patterns of Biovar Ovis and Equi Strains 

A MST was created based on ERIC 1+2-PCR fingerprint 
(Figure 4). The MST revealed the existence of three major clonal 
complexes that clustered around nitrate-negative (Ovis) and 
nitrate-positive (Equi) strains, although no specific genotypic 
profile was observed for C. pseudotuberculosis nitrate-positive and 
nitrate-negative strains by ERIC 1+2PCR. Corynebacterium pseudo- 
tuberculosis biovar Equi strains, with the exception of two strains 
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Figure 1. Cluster analysis by ERIC-PCR fingerprint (ERIC 1) of 99 C. pseudotuberculosis field isolates, 1002 vaccine strain and ATCC 
1 941 0 T type strain. Clustering analysis was performed with aid of BioNumerics 6.5 (Applied Maths, Sint-Martens-Latem, Belgium) and based on the 
Dice similarity coefficient and the unweighted pair group method with arithmetic mean (UPGMA). Eleven major clusters labeled A-K were defined 
from groups of closely related strains sharing on average —80% of genotype similarity. The numbers in parentheses correspond to the number of 
isolates within the genotype. 
doi:1 0.1 371 /journal.pone.0098758.g001 
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Figure 2. Cluster analysis by ERIC-PCR fingerprint (ERIC 2) of 99 C. pseudotuberculosis field isolates, 1002 vaccine strain and ATCC 
1 941 0 T type strain. Clustering analysis was performed with aid of BioNumerics 6.5 (Applied Maths, Sint-Martens-Latem, Belgium) and based on the 
Dice similarity coefficient and the unweighted pair group method with arithmetic mean (UPGMA). Nine major clusters labeled A-l were defined from 
groups of closely related strains sharing on average —80% of genotype similarity. The numbers in parentheses correspond to the number of isolates 
within the genotype. 
doi:1 0.1 371/journal.pone.0098758.g002 
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Figure 3. Cluster analysis by ERIC-PCR fingerprint (ERIC 1+2) of 99 C. pseudotuberculosis field isolates, 1002 vaccine strain and ATCC 
1 941 0 T type strain. Clustering analysis was performed with aid of BioNumerics 6.5 (Applied Maths, Sint-Martens-Latem, Belgium) and based on the 
Dice similarity coefficient and the unweighted pair group method with arithmetic mean (UPGMA). Eleven major clusters labeled A-K were defined 
from groups of closely related strains sharing on average —80% of genotype similarity. The numbers in parentheses correspond to the number of 
isolates within the genotype. 
doi:1 0.1 371 /journal.pone.0098758.g003 
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isolated from buffalo, exhibited a pattern of clustering (cluster F), 
as well as observed for biovar Ovis strains, which were 
predominant in all other clusters but cluster F (Figure 4 panel 
A). All biovar Ovis strains grouped together with biovar Equi 
strains were sheep isolates, most from Sao Paulo State, Brazil, and 
one from Argentina. 

Discussion 

Corynebacterium pseudotuberculosis has been considered an excep- 
tionally homogeneous species, showing a low genetic diversity 
among isolates by several different molecular typing assays [17- 
23]. However, our recent results show that the amplification of 
DNA between successive repetitive intergenic consensus elements 
through ERIC-PCR is a good method for molecular character- 
ization of C. pseudotuberculosis strains isolated from sheep and goat, 
with great discriminatory power and typeability, besides the good 
repeatability [24,25]. In this study, our findings were broaden by 
characterizing with ERIC-PCR assays (ERIC 1, ERIC 2 and 
ERIC 1+2) a very diverse population of C. pseudotuberculosis isolates 
from both biovars, Ovis and Equi, including strains from eight 
different hosts (buffalo, camel, cattle, goat, horse, human, llama 
and sheep) isolated from twelve countries (Argentina, Australia, 
Belgium, Brazil, Chile, England, France, Egypt, Israel, Kenya, 
Scotland, USA) (Table 1). 

Molecular typing methods are commonly used to investigate 
epidemiological relationships among isolates and sources of 
infection. However, before being used for those purposes, PCR 
methods for molecular typing require careful in-house validation 
of typeability, reproducibility, repeatability, stability, discrimina- 
tory power and epidemiologic concordance [36,41]. Since 
Versalovic et al. (1991) [34] evaluated the ERIC-PCR technique 
for eubacteria, the method has been successfully applied for 
genotyping of different microbial pathogens, including gene 
mapping, detection of strain diversity, population analysis, 
epidemiology, and the demonstration of phylogenetic and 
taxonomic relationships [42]. Our data showed that, as reported 
in our previous studies with C. pseudotuberculosis isolates from 
Brazilian sheep and goat, the ERIC-PCR (all assays) showed high 
typeability, with all strains being fingerprinted and assigned a type. 
Moreover, the assays presented a high discriminatory power as 
shown by the high HGDI indexes observed. These high 
discriminatory power and typeability, associated with its high 
repeatability [24], supports the use of ERIC-PCR as a good 
molecular typing technique also for C. pseudotuberculosis strains. 

Following the in-house ERIC-PCR validation, based on that 
neither laboratory nor epidemiologic evidence is definitive by 
itself, but each one validates the other [43], the epidemiological 
concordance of C. pseudotuberculosis genotyping by this method was 
established. Since, strains epidemiologically related (from same 
origin) exhibited identical genotypes (all ERIC-PCR), which in 
some cases were also identical to genotypes previously described 
for C. pseudotuberuculosis isolates from the same region [25]. Of the 
six epidemiologically related strains (331, 336, 445, 446, 447 and 
453), for ERIC 1-PCR, four had genotypes identical to previously 
described ones, and for ERIC 2-PCR and ERIC 1+2-PCR the 
number of strains that had identical genotypes was four and five, 
respectively. 

Moreover, of the types assigned by each ERIC-PCR, the 
majority (84.15% ERIC 1; 90.09% ERIC 2; 90.09% ERIC 1+2) 
correspond to new types not yet reported, which is consistent with 
the present sampling that is composed by only 6.06% (6/99) of 
field strains epidemiologically related with the previous samples 
[25]. The great number of novel genotypes observed may be the 



result of differences between hosts (sheep, goat, buffalo, horse, 
catde, camel, llama and human), origin (Argentina, Australia, 
Belgium, Brazil, Chile, England, France, Egypt, Israel, Kenya, 
Scotland, USA), isolation year (1952 vs 2009) or evolutionary 
changes. 

We also found that epidemiological concordance of the ERIC- 
PCR genotyping of C. pseudotuberculosis reflects epidemiological 
links observed in the formation of the sheep flock of Minas Gerais 
State, what corroborates our previous findings [24]. Data (2008/ 
2009) from the state agency for animal health (Instituto Mineiro de 
Agropecuaria - IMA) showed that there was a large transit of 
sheep from different states of Brazil (Distrito Federal, Espirito 
Santo, Goias, Rio de Janeiro, Sergipe, Sao Paulo) into Minas 
Gerais, contributing to the constant growth of the state sheep 
population. Thus, validating the epidemiological data and vice- 
versa, some genotypic profiles found by Guimaraes et al. (2011) 
[24] for C. pseudotuberculosis sheep isolates from Minas Gerais State 
were identical to genotypes observed for C. pseudotuberculosis sheep 
isolates from Pernambuco [ERIC 1 (El. 3), ERIC 2 (E2.1) and 
ERIC 1+2 (E12.22)] and Sao Paulo States [ERIC 1 (El.l; E1.4) 
and ERIC 2 (E2.8)]. 

Regarding the three ERIC-PCR techniques used in this study, 
we had already previously shown that they were highly concordant 
among themselves, i.e., the genotypic differences observed by one 
of the techniques is very similar to differences observed by the 
others. This was also observed in the present study from a large 
sample of C. pseudotuberculosis isolates from different hosts and 
countries, which show high Kendall's W coefficient of concor- 
dance. Thus, considering that the genetic diversity observed by 
ERIC-PCR assays are much alike, we selected two other 
parameters, HGDI and suitability of the amplification patterns, 
to define the best typing assay among the three ERIC-PCR assays 
for molecular characterization of C. pseudotuberculosis strains. The 
HGDI is a widely used index suitable to compare different typing 
systems [35]. DNA amplification results, which evaluates the 
consistency, complexity and performance of an amplification 
system, also allows the comparison among DNA-fingerprinting 
methods. Based on these parameters, we found that the ERIC 1+ 
2-PCR is the best assay among the ERIC-PCR tested, since this 
assay presented the highest HGDI and a suitable amplification 
pattern providing more distinct DNA amplification bands, a good 
average number of bands per genotype and less smearing (data not 
shown). ERIC 2-PCR also demonstrated the same amplification 
characteristics of ERIC 1+2 primer set, however presented a low 
HGDI, despite having shown the best index in previous studies 
[24,25]. In contrast, ERIC 1-PCR showed the lowest HGDI in all 
our studies [24,25]. Furthermore, ERIC 1-PCR presented in all 
studies a less distinct and outnumbered band pattern. Versalovic 
et al (1991) [34] also observed that the primer ERIC-1 alone 
yielded limited amplification products. In addition, when the 
cluster analysis was based on similarity greater than 80%, 
classification by ERIC 1-PCR or ERIC 2-PCR depicted a large 
cluster composed by ~85% of the strains. 

One of the first techniques proposed to type C. pseudotuberculosis 
was biotyping, which divided the isolates in biovar Ovis and Equi, 
chiefly associated with strains isolated from sheep and horses, 
respectively [5]. Phenotypic characteristics can be linked to 
genotypes [42] , as it was described for the nitrate-reducing ability 
related to different restriction patterns and ribotypes [17-19], 
whereas no genetic pattern between nitrate-positive and nitrate- 
negative C. pseudotuberculosis strains was depicted by ERIC-PCR. 
However, MST data analyses showed that, despite no association 
of nitrate reduction capability and ERIC-PCR genotypes, there 
was a clustering of isolates with similar results on nitrate-reduction 
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Figure 4. Minimal spanning trees (MSTs) by ERIC 1+2-PCR of 99 C. pseudotuberculosis field isolates, 1002 vaccine strain and ATCC 
19410 T type strain. A) Clonal complexes according to biovar of the strains. B) Clonal complexes according to host origin of the strains. C) Clonal 
complexes according to country origin of the strains. The MST presented is the tree with the highest overall reliability score and were calculated using 
the UPGMA associated with the priority rule and the permutation resampling using Bionumerics 6.5 (Applied Maths, Sint-Martens-Latem, Belgium) 
[37,38]. The number within of node indicates the cluster observed in dendrogram analyses (Figure 3). The numbers between two neighboring ERIC 1 + 
2-PCR types indicate distance between them. The sizes of the nodes depend on the number of strains (their population size). Wedges in circles 
represent the proportion of C. pseudotuberculosis isolates from respective sources. 
doi:1 0.1 371 /journal.pone.0098758.g004 

test (Figure 4 panel A). Biovar Equi strains, with few exceptions, predominant in all other clusters but cluster F (Figure 4 panel A), 
clearly clustered together (cluster F), while biovar Ovis strains were Interestingly, all biovar Ovis strains grouped in cluster F were 
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sheep isolates, most from Sao Paulo State, Brazil, and one from 
Argentina. The differences in the clustering pattern of biovar Ovis 
and Equi strains reflects the great number of genes not shared by 
both biovars, as complete genome analyses of 1 5 C. pseudotubercu- 
losis strains showed that biovar Ovis contain 314 orthologous genes 
that are shared by all strain from this biovar but are absent from 
one or more strains of biovar Equi [44]. Furthermore, biovar Equi 
strains have 95 core genes that are absent from one or more strains 
of biovar Ovis [44] . 

The MST analysis also revealed the existence of three major 
clonal complexes, from which other clonally related isolated 
groups emerge, one in cluster F, the other in cluster E and another 
in cluster A (Figure 4). These inferences may become useful to 
develop a model for evolutionary steps in the difference of the C. 
pseudotuberculosis ERIC 1+2-PCR genotypes, nevertheless, more 
representative sampling is needed for inclusion into this model for 
a more robust comparison. 

Associating the MST analysis with geographical or host origin is 
difficult because most of strains were isolated from a particular 
host belong to the same country (buffalo, goat and catde), or 
because some strains are not well represented in the sample 
(camel, llama and man). However, the analyses of segregation of 
the strains with respect to geographical origin and host distribution 
among clusters determined by MST showed that, in spite of being 
the most heterogeneous geographical group (Table 1), all horse 
isolates were grouped in cluster F, as well others biovar Equi 
strains. One exception was a horse isolate from Scotland that was 
typed as biovar Ovis. This segregation pattern determined by 
biovar was also observed for water buffalo isolates, all from Egypt, 
which presented different clustering patterns: most of the biovar 
Equi isolates were in cluster F, whereas biovar Ovis isolates were 
grouped in clusters A, E and I, together with biovar Ovis isolates 
from other regions. Water buffalo isolates 45 and 49 were 
exceptions by being classified as biovar Equi and grouped in 
cluster A. Cattle strains also presented similar clustering patterns, 
with biovar Ovis strains being grouped into cluster E and biovar 
Equi strain grouped in cluster F. For buffaloes and cattle, which 
are not the main hosts of C. pseudotuberculosis, this biovar-clustering 



of isolates could be related to the host those species had acquired 
the infection from, since biovar Equi strains are closely associated 
to horse infection and biovar Ovis strains are mostly isolated from 
sheep or goat infection, representing, respectively, the usual cause 
of disease in horses and in sheep and goats [5] . Thus, it may also 
explain the clustering of camel, llama and human C. pseudotuber- 
culosis isolates. 

Sheep and goat C. pseudotuberculosis strains were grouped into 
several clusters and consequently were spread through different 
clonal complexes in MST. As most of the sheep and goat studied 
isolates (>90%) were from Brazil, those diverse types and 
clustering were probably due to the increase of the Brazilian 
sheep (14.8%) and goat (57.8%) commercial herds in recent years 
(1995 to 2006) [45], with intense traffic of animals sold for 
breeding and formation of new herds. Considering that the 
majority of sheep and goat isolates were from the Brazilian States 
of Bahia and Sao Paulo, respectively, it is noteworthy that Sao 
Paulo State had an increase of 86. 1 % in sheep herd population 
between 1995 and 2006, and that Bahia recorded a significant 
increase (1 1.3%) in its goat herd, the largest in the country, during 
the same period [45] . Furthermore, this growth in Brazilian sheep 
and goat herd population, due to an expansion of markets in all 
regions, was based in the importation of animals from various 
countries, mainly South Africa and Europe, which may have 
favored the entry and spread of different C. pseudotuberculosis strains 
and could also explain the genotypic similarity of isolates from 
Brazil with strains from those regions. 

In conclusion, ERIC 1+2-PCR proved to be a good technique 
for genotyping of C. pseudotuberculosis strains, due to its usefulness 
for molecular epidemiology investigations. 
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